Likelihood Methods for Cluster Dark Energy Surveys 
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Galaxy cluster counts at high redshift, binned into spatial pixels and binned into ranges in an 
observable proxy for mass, contain a wealth of information on both the dark energy equation of state 
and the mass selection function required to extract it. The likelihood of the number counts follows a 
Poisson distribution whose mean fluctuates with the large-scale structure of the universe. We develop 
a joint likelihood method that accounts for these distributions. Maximization of the likelihood over 
a theoretical model that includes both the cosmology and the observable-mass relations allows for 
a joint extraction of dark energy and cluster structural parameters. 



I. INTRODUCTION 

Upcoming high redshift cluster surveys have the po- 
tential to provide precision constraints on the evolution 
of the dark energy density due to the exponential sensi- 
tivity of the abundance of massive dark matter halos to 
the growth of structure (e-g- HQ) The wealth of infor- 
mation contained in even the cluster counts alone par- 
tially offsets the fundamental problem that the masses 
of the clusters are not directly observable. Techniques 
that utilize this information to relate cluster observables 
such as Sunyaev-Zel'dovich (SZ) flux, X-ray temperature 
and surface brightness, or optical richness and shear to 
cluster masses and hence the cosmology are known in the 
literature as "self-calibration" methods 00113 

Extracting the full information from the number 
counts requires a joint analysis of the mean abundance 
and spatial distribution. It is also required in that the 
spatial clustering of clusters is the source of sampling 
errors for the counts @. Likewise, cluster power spec- 
trum estimators must account for the non-Gaussian dis- 
tribution of their errors even in the linear regime. This 
fact distinguishes cluster likelihood methods from their 
galaxy counterparts 0, 01 in requiring a full treatment of 
the Poisson shot noise 

Likelihood analyses of the local cluster abundance in 
the literature conversely include the Poisson shot noise 
but omit the power spectrum source of error and infor- 
mation (see e.g. This omission is justified as 
they typically utilize only a handful of the very rarest 
clusters where shot noise dominates and the likelihood is 
Poisson. Fisher matrix studies show that for sufficiently 
deep surveys the likelihood deviates significantly from the 
Poisson limit and that these deviations contain use- 
ful information for self-calibration |14lll5| . For example, 
~ 10 4 high redshift clusters are expected in the upcom- 
ing South Pole Telescope (SPT) survey [2(j , allowing the 
sample to be divided both spatially and in SZ flux. 

In this Brief Report, we develop the likelihood tech- 
niques required to extract this information. We show 
that the joint Poisson-Gaussian likelihood can be reduced 
to a closed-form expression that is applicable to the next 
generation high-z cluster surveys. We begin in iflTl with 



a general description of the likelihood function and spe- 
cialize it to the case of rare high-z clusters in flIIII We 
conclude in illVI 



II. CLUSTER LIKELIHOOD FUNCTION 



Let us take the data to be the number of clusters 
'ip in bins of some observable proxy for mass = 



j) and pixels delineated by angle and redshift 
1, ...,n p ). We will assume that the clusters are 
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Poisson distributed with a mean of m^. This mean fluc- 
tuates from pixel to pixel due to the large scale structure 
of the universe. We will further assume that these pixels 
are sufficiently large that the fluctuations are in the lin- 
ear regime. Denoting the ensemble means with overbars, 
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where 5i is the overdensity within the pixel and 6^ is the 
linear bias of the selected clusters. 

The distribution of Poisson means is then given by a 
Gaussian with covariance |(| 

((m ijU - fh ia ){mj V - fhju)) = m.^ni ,,.!),,, b,,.S, , , (2) 

where 
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Here Wi is the Fourier transform of the pixel window, 
normalized such that J d 3 xWi(x) — 1 and P(k;z i j) is 
the linear power spectrum at the mean redshift of the 
pixels Zij. 

Through TV-body simulations, a cosmological model 
predicts the comoving spatial number density of clusters 
as a function of their mass M (e.g. [i H fl7 l ITl . IT ^ L 
Given a selection in mass p^ (M) and the comoving vol- 
ume of the pixel Vi, the ensemble mean becomes 



m iu =Vi J dlnMpi^M) 
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Likewise, simulations predict the bias of clusters as a 
function of their mass b(M) and hence 

dn 



bin = ~ 
rr, 



[ dlnMp ifi (M)b(M)-^- . (5) 
m v J d In M 

For a completely fixed selection p ifl , the cosmology 
uniquely determines both the set of ensemble mean 
counts ffiiu and their distribution through b ifl for each 



pixel i. This fact allows a measure of "self-calibration" 
of the selection function p^(M) through the relation- 
ships between b^ L and in a given pixel 0, 0] . Note 
that the selection function can include effects such as in- 
strument noise and point source contamination as well as 
the observable-mass relation itself. 

To exploit this information, one can maximize the like- 
lihood of the data over a theory defined by both cos- 
mological and selection parameters. Fisher matrix tech- 
niques estimate that recovery of the dark energy equation 
of state at the ~ 10% level for next generation surveys 
like the SPT is possible, even in the presence of moderate 
uncertainties in the selection function 0, Il4l fl5| . 

In practice, extracting this information will require an 
exploration of the likelihood space. Combining the Pois- 
son likelihood of drawing from riii^ and the Gaussian 
likelihood of drawing from m^, we obtain [1 1| 
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Here and throughout we use vector and matrix notation 
where no confusion will arise. The lower limit on the 
integral is required to prevent negative Poisson means. 

1 /2 

As long as b^S^ <C 1, the likelihood is insensitive to the 
exact value of & max and so we set it to the maximum bias 
for clusters obtainable in the model. In more marginal 
cases, the missing probability from —1 < 5 < — l/6 max 
can be restored to the N — likelihood (see £11111 for 

1 /2 

further discussion). For SJ > 1, the density field itself 
is not Gaussian. 

Since depends on Si through Eqn. (JTJ, the full 
likelihood is a convolution of the Poisson and Gaussian 
distributions. To evaluate this convolution efficiently, let 
us rewrite it as 
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where the generating function 
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FIG. 1: Covariance matrix as a function of pixel size p i x at z = 

1/2 

0.7, Az = 0.1: (a) rms density fluctuation per pixel S,l ; (b) 
correlation Sij /(SaSjj) 1 ^ 2 for neighboring pixels. For (? p i x > 1° 
the covariance matrix is nearly diagonal with negligible correlations 
beyond the nearest neighbor pixel. 



This integral may be transformed into one over a Gaus- 
sian by completion of squares through the diagonaliza- 
tion of the pixel covariance matrix S = ASA T where 
S = diag(er^) and A is an orthonormal matrix of eigen- 
vectors. If the covariance matrix is already nearly diag- 
onal, the integration range will likewise transform to a 
semi-infinite interval such that 

« P 

F(m, b, S, A) = Yl e^«M [1 _ _ E rfc(y fc /V2)] , (9) 

k=l 

where 

Rk = ^ K^fhi^bi^Aik , 

(10) 



Vk = 



-, — — Rk<ik 



The Erfc term appears because of the boundary at — 6~^x 
and must be included for cases where the Poisson and 
Gaussian variance are comparable even if b max &k < 1 as 
required by Eqn. ©. 



III. COARSE PIXEL LIMIT 

Clusters at high redshift are extremely rare objects and 
hence coarse pixelization is required to have a finite ex- 
pectation value filial per pixel. The likelihood function 
simplifies considerably in this limit. 

For definiteness, let us assume that a survey like the 
SPT counts clusters above an observable threshold that 
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corresponds to a mean mass of M t h = IP 14,2 hr 1 Mq with 
a o"inM = 0.25 Gaussian scatter in lnM (see [14J for 
calculational details). We will take as the fiducial cos- 
mology a flat ACDM model with J7a = 0.73, baryon 
density f^/i 2 = 0.024, matter density fl m h 2 — 0.14 and a 
scale invariant initial curvature spectrum with amplitude 
5<; = 5.07 x 10~ 5 corresponding to er 8 = 0.91. Near the 
median of the redshift distribution at z = 0.65 — 0.75, the 
angular number density of these clusters is 0.86 deg. -2 
in this fiducial cosmology and they have a linear bias of 
6 w 4. To have an expectation value of at least 1 cluster 
per pixel, the pixels must be at least 1° x 1° in size. 

The transverse size of the pixel is L plx = D9 plx rj 
31/i -1 Mpc (#pi x /l°) where D is the comoving angular 
diameter distance to redshift z rj 0.7. Since a pixel of 
several degrees on the side receives its fluctuations from 
k modes near the peak of the power spectrum, the spec- 
trum is nearly white and the pixel covariance matrix is 
nearly diagonal. 

To quantify this expectation, let us take square pixels 
on the sky of angular extent 8 pix with a depth Az cen- 
tered around a redshift z. The covariance in the radial 
direction is negligible for Az = 0.1 and so we calculate 
only the angular pixel covariance at each redshift slice. 
The Fourier window function is then 

|W(k)| = j (k x L pbc /2)j (k y L pix /2)j (k z Az/2H), (11) 
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FIG. 2: Likelihood as a function of counts Ni above a mean thresh- 
old mass M t h = 10 14 ' 2 h- 1 Af Q and z = 0.7, Az = 0.1 for (a) 
6> pix = 2° and (b) pix = 4° . Shown are the full likelihood (solid) 
from Eqn. 1131 compared with a Poisson likelihood (long dashed) 
and the m; 3> 1 limiting form in Eqn. 1141 of a convolution of 
Gaussian shot noise and sample variance (dashed). 



where H is the Hubble parameter. The pixel covariance 
matrix becomes 



Sij — 
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(12) 

xlJ is the number of pixels 
separating i and j in the x direction and likewise for Ay^ . 

In Fig.QJ we plot the correlation of the nearest 2 neigh- 
boring pixels. Even for # P j X — 1°, the covariance matrix 
beyond the nearest neighbor is negligible. Therefore Sy 
is extremely sparse and the diagonalization can be ef- 
ficiently implemented. In fact, for 9 piyi > 3° a strictly 
diagonal covariance matrix is a good approximation. 

Now let us examine the likelihood function under this 
diagonal covariance matrix approximation. For sim- 
plicity, we first also assume that there is only one bin 
in observable mass such tha t the s election function is 
Pi(M) = iErfc[ln(M th /M)/ v /2cr 2 nM ], i.e. a smooth func- 
tion of the mass that increases from to 1 across the 
threshold. The likelihood then simplifies to C = n™=i Ci, 
where 



Ci = 



1 
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In Fig. |3 we plot this likelihood as a function of Ni 
and compare it with the Poisson likelihood. Here we take 
two representative cases: f? p i x = 2° where m, = 3.45, 



Oj = 0.097; and 6» pix = 4° where m, = 13.9, er^ = 0.057. 
The full likelihood is broader than the Poisson likelihood 
due to the fluctuating Poisson mean. This excess variance 
can be used to self-calibrate the mass selection through 
the bias in a manner that is stable to the choice of pix- 
elization in this range [T^j . 

This likelihood is normalized such that J2n A(-^i) = 
1 for a fixed model in the limit that biUi <C 1. In practice, 
the finite biUi will mean that there is a finite probabil- 
ity of drawing a Poisson mean of rrii = and hence no 
clusters in the pixel. This probability has been omitted 
m Eqns. ® and (O but may be directly restored to the 
TVj = likelihood. For # pix = 2°, this correction is 0.005; 
for 4°, this correction is a negligible 8 x 10 -6 . 

For the 6> p i X = 4° case, m, 3> 1 and the Poisson distri- 
bution of N around rrii should be nearly Gaussian. In 
this limit, the full likelihood is approximately the convo- 
lution of two Gaussians and becomes 



1 ({Ni-rhif 



(14) 



where s 2 = mi+mfbfaf is the total variance of the convo- 
lution. Fig. [3 shows the accuracy of this approximation. 

Finally, it is instructive to consider the case where the 
Poisson variance is much larger than the Gaussian sam- 
ple variance, i.e. m, 3> mfb^af. In this case the Erfc 
boundary term is negligible and Taylor expanding the 
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likelihood gives a simple closed form 

For the fiducial survey at M th = lO 14 - 2 ^ 1 M , fh^faf = 
0.52 for 6> pix = 2° and 0.76 for 8 pix = 4°. This im- 
plies that the Poisson variance is comparable to the sam- 
ple variance and so this limiting case does not apply. 
For a finer pixelization, this limit can be reached but 
not before the linear theory assumption breaks down. 
Nevertheless, a survey that selects rarer clusters may 
reach this limit in the linear regime. For example with 
M th = IO^/i^Mq, fhi = 4.6 for 6> pix = 4° pixels and 
bi = 4.7, and the approximation deviates by less than 
5% near the peak. It is also useful in understanding the 
qualitative behavior: the Gaussian variance of the Pois- 
son mean m, reduces the likelihood near the ensemble 
mean Ni — fhi and enhance it in the tails. 

Likewise, the Poisson dominated limit is useful in un- 
derstanding the effect of having multiple observable-mass 
bins per pixel. For example with 2 bins 

+b i2 (N i2 ~ m i2 )} 2 - b 2 a N n - b 2 2 N l2 }af) . (16) 

The enhancement of the tails of the distribution occurs 
most strongly for joint fluctuations of Nn and N i2 as 



would be expected from the joint fluctuations of their 
Poisson means run and ma. 

The two cases of a nearly diagonal pixel covariance and 
a Poisson dominated likelihood considered here are the 
ones of practical importance for high redshift clusters. 



IV. DISCUSSION 

We have presented a closed-form expression for the full 
likelihood function for cluster number count surveys in- 
cluding both the Poisson shot noise and Gaussian sample 
variance of the Poisson means from the large-scale struc- 
ture of the universe. We also allow for multiple bins in 
the observable mass per spatial pixel. 

This treatment is especially useful for high redshift 
cluster surveys. Here the spatial covariance is nearly di- 
agonal for pixels that are sufficiently large to have an 
expectation value of more than one cluster per pixel. 
A sparse covariance matrix allows the likelihood to be 
evaluated efficiently. Maximization of the likelihood over 
models for both the cosmology and the observable-mass 
relations will allow future cluster surveys to jointly de- 
termine dark energy and cluster structural parameters. 
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